TopChurn: Maximum Entropy Churn Prediction Using Topic Models Over Heterogeneous Signals
نویسندگان
چکیده
With the onset of social media and news aggregators on the Web, the newspaper industry is faced with a declining subscriber base. In order to retain customers both on-line and in print, it is therefore critical to predict and mitigate customer churn. Newspapers typically have heterogeneous sources of valuable data: circulation data, customer subscription information, news content, and search click log data. An ensemble of predictive models over multiple sources faces unique challenges – ascertaining short-term versus long-term effects of features on churn, and determining mutual information properties across multiple data sources. We present TopChurn, a novel system that uses topic models [5, 29, 24] as a means of extracting dominant features from user complaints and Web data for churn prediction. TopChurn uses a maximum entropy-based approach [21] to identify features that are most indicative of subscribers likely to drop subscription within a specified period of time. We conduct temporal analyses to determine long-term versus short-term effects of status changes on subscriber accounts, included in our temporal models of churn; and topic and sentiment analyses on news and clicklogs, included in our Web models of churn. We then validate our insights via experiments over real data from The Columbus Dispatch, a mainstream daily newspaper, and demonstrate that our churn models significantly outperform baselines for various prediction windows.
منابع مشابه
A New Method for Detection of Backscattered Signals from Breast Cancer Tumors: Hypothesis Testing Using an Adaptive Entropy-Based Decision Function
Introduction In recent years methods based on radio frequency waves have been used for detecting breast cancer. Using theses waves leads to better results in early detection of breast cancer comparing with conventional mammography which has been used during several years. Materials and Methods In this paper, a new method is introduced for detection of backscattered signals which are received by...
متن کاملHierarchical Alpha-cut Fuzzy C-means, Fuzzy ARTMAP and Cox Regression Model for Customer Churn Prediction
As customers are the main asset of any organization, customer churn management is becoming a major task for organizations to retain their valuable customers. In the previous studies, the applicability and efficiency of hierarchical data mining techniques for churn prediction by combining two or more techniques have been proved to provide better performances than many single techniques over a nu...
متن کاملChurn Analysis in a Music Streaming Service Predicting and understanding retention GUILHERME DINIS CHALIANE JUNIOR KTH ROYAL INSTITUTE OF TECHNOLOGY SCHOOL OF INFORMATION AND COMMUNICATION TECHNOLOGY Churn Analysis in a Music Streaming Service Predicting and understanding retention
Churn analysis can be understood as a problem of predicting and understanding abandonment of use of a product or service. Di erent industries ranging from entertainment to financial investment, and cloud providers make use of digital platforms where their users access their product offerings. Usage often leads to behavioural trails being left behind. These trails can then be mined to understand...
متن کاملProfit Driven Decision Trees for Churn Prediction
Customer retention campaigns increasingly rely on predictive models to detect potential churners in a vast customer base. From the perspective of machine learning, the task of predicting customer churn can be presented as a binary classification problem. Using data on historic behavior, classification algorithms are built with the purpose of accurately predicting the probability of a customer d...
متن کاملPrediction of daily precipitation of Sardasht Station using lazy algorithms and tree models
Due to the heterogeneous distribution of precipitation, predicting its occurrence is one of the primary and basic solutions to prevent possible disasters and damages caused by them. Considering the high amount of precipitation in Sardasht County, the people of this city turning to agriculture in recent years and not using classification models in the studied station, it is necessary to predict ...
متن کامل